Intelligent Key Prediction by N-grams and Error-correction Rules
نویسندگان
چکیده
In this paper, we propose an intelligent aid for text input method in order to provide an easier way for text inputting with the conventional keyboard. We use the character n-gram model and error correction rules to identify the language being typed and to predict the most probable character string without extra keystroke. The character n-gram model is also required only a small amount of memory spaces when using Bi-gram and Tri-gram. The paper also proposes rule-reduction algorithm applying mutual information to reduce the error-correction rules. Our algorithm archives more than 99% accuracy in both language identification and key prediction.
منابع مشابه
Towards an Intelligent Multilingual Keyboard System
This paper proposes a practical approach employing n-gram models and error-correction rules for Thai key prediction and Thai-English language identification. The paper also proposes rule-reduction algorithm applying mutual information to reduce the errorcorrection rules. Our algorithm reported more than 99% accuracy in both language identification and key prediction.
متن کاملThe StringNet Lexico-Grammatical Knowledgebase and its Applications
This demo introduces a suite of web-based English lexical knowledge resources, called StringNet and StringNet Navigator (http://nav.stringnet.org), designed to provide access to the immense territory of multiword expressions that falls between what the lexical entries encode in lexicons on the one hand and what productive grammar rules cover on the other. StringNet’s content consists of 1.6 bil...
متن کاملA Statistical Approach to Automatic OCR Error Correction in Context
This paper describes an automatic, context-sensitive, word-error correction system based on statistical language modeling (SLM) as applied to optical character recognition (OCR) postprocessing. The system exploits information from multiple sources, including letter n-grams, character confusion probabilities, and word-bigram probabilities. Letter n-grams are used to index the words in the lexico...
متن کاملDeveloping an Unsupervised Grammar Checker for Filipino Using Hybrid N-grams as Grammar Rules
This study focuses on using hybrid n-grams as grammar rules for detecting grammatical errors and providing corrections in Filipino. These grammar rules are derived from grammatically-correct and tagged texts which are made up of part-of-speech (POS) tags, lemmas, and surface words sequences. Due to the structure of the rules used by this system, it presents an opportunity to have an unsupervise...
متن کاملGrammatical Error Correction of English as Foreign Language Learners
This study aimed to discover the insight of error correction by implementing two correction systems on three Iranian university students. The three students were invited to write four in-class essays throughout the semester, in which their verb errors and individual-selected errors were corrected using the Code Correction System and the Individual Correction System. At the end of the study, the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001